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A METHOD AND APPARATUS FOR MEMORY 
MANAGEMENT OF VIDEO IMAGES 


FIELD OF THE INVENTION 


The present invention relates to video images, and, in particular, to a novel storage 
format for enabling improved memory management of video images. 


In accordance with the NTSC (National Television Standards Committee) and 
PAL (Phase Alternating Line) standard, video images are presented in the YUV color 
space. The Y signal represents a luminance value while the U and V signals represent 

10 color difference or chrominance values. YUV video image data may be transmitted in 

packed format or planar format. In packed format, all the data for a given set of pixels of 
the video image is transmitted before any data for another set of pixels is transmitted. As 
a result, in packed format, YUV data is interleaved in the transmitted pixel data stream 
100, as depicted in FIG. 1 (YUY2 packed format). In planar format, Y, U and V data 

15 values are stored into separate Y, U and V memory areas (planes) in system memory 110, 
as depicted in FIG. IB. 

FIGS. 2A, 2B and 2C are diagrams illustrating three different formats for 
representing video images in the YUV color space. A video image frame may consist of 
three rectangular matrices representing the luminance Y and the two-chrominance values 

20 U and V.^Y matrices 120, 130 and 140 have an even number of rows and columns. In 
YUV 4:2:0 color space format, chrominance component matrices 122 and 124 may be 
one half in size of Y matrix 120 in horizontal and vertical directions as depicted in FIG. 
2A. In YUV 4:2:2 format, chrominance component matrices 132 and 134 may be one 
half in size of Y matrix 130 in the horizontal direction and the same size in the vertical 

25 direction as depicted in FIG. 2B. Finally, in YUV 4:4:4 format, chrominance component 
matrices 142 and 144 may be the same size as Y matrix 140 in the horizontal and vertical 
directions as depicted in FIG. 2C. 


5 


BACKGROUND OF THE INVENTION 
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To store video data efficiently, conventional digital video systems contain a data 
compressor that compresses the video image data using compression techniques. Many 
conventional compression techniques are based on compressing the video image data by 
processing the different pixel components separately. For example, in accordance with 
5 Motion Picture Experts Group (MPEG) or International Telecommunications Union 
(ITU) video compression standards, a YUV-data compressor may encode the Y data 
independently of encoding U data and encoding V data. Such a compressor preferable 
receives video data in planar format, in which the Y, U, and V data for multiple pixels are 
separated and grouped together in three distinct data streams of Y only, U only and V 

10 only data, as described above (FIG. IB). 

Although planar format provides significant advantages for data compression, 
several disadvantages arise when storing or processing data received in planar format. 
For example, a video decoder that receives video image data in YUV planar format 
requires three pointers to the Y, U and V component values. For basic DVD (Digital 

15 Versatile Disk) and HDTV (High Definition Television) mode, each macroblock has 

three blocks of pixels: Y: 16x16, U:8x8 and V:8x8. In addition, the U and V components 
are located in different memory locations. In terms of code size, three blocks of code are 
required for conventional motion compensation of the video image data. Moreover, a 
separate memory page usually must be opened for each YUV component. 

20 In terms of cache efficiency, for YUV video in the 4:2:0 format (FIG. 2A), the 

useful area (in a cache line) for the Y-component is about sixteen-bytes per cache line. 
For the U and V components, the useful area is eight-bytes per line per color-component. 
Therefore, two rows in a macroblock potentially occupy four cache lines since the U and 
V components are vertically and horizontally sub-sampled in 4:2:0 format (two Y cache 

25 lines, one U cache line, and one V cache line). For YUV video in 4:2:2 format (FIG. 2B), 
six cache lines are required (2 Y cache lines, 2 U cache lines, and 2 V cache lines.) 
Although YUY2 packed format (FIG. 1 A), as described above, uses only two cache lines 
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and could be used to overcome this cache inefficiency problem, conventional motion 
compensation of data in YUY2 format is inefficient. 

Therefore, there remains a need to overcome the limitations in the above 
described existing art, which is satisfied by the inventive structure and method described 
hereinafter. 
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BRIEF DESCRIPTION OF THE DRAWINGS 
Additional features and advantages of the invention will be more apparent from 
the following detailed description and appended claims when taken in conjunction with 
the drawings, in which: 

5 FIG. 1A illustrates a video image data stream in YUV2 packed format as known 

in the art. 

FIG. IB illustrates video image data stored in YV12 pure planar format, as 
known in the art. 

FIGS. 2A-C illustrate the YUV 4:2:0 color space format, the YUV 4:2:2 color 
10 space format and the YUV 4:4:4 color space format, as known in the art. 

FIG. 3 illustrates a block diagram of a conventional computer system as known in 

the art. 

FIG. 4 is a block diagram illustrating a video decoder in accordance with one 
embodiment of the present invention. 
15 FIGS. 5 A and 5B illustrate YUV color space storage formats as known in the art. 

FIG. 5C illustrates a block diagram of a mixed storage format according to an 
exemplary embodiment of the present invention. 

FIGS. 6A-6B illustrate motion compensation of decoded blocks according to a 
further embodiment of the present invention. 
20 FIG. 7A is a block diagram illustrating steps for encoding and decoding MPEG 

image data as known in the art. 

FIG. 7B is a block diagram illustrating a video encoder in accordance with a 
further embodiment of the invention. 

FIG. 8 is a block diagram illustrating a write combining (WC) buffer as known in 

25 the art. 

FIG. 9 illustrates method steps for decoding an encoded MPEG video bit stream 
as known in the art. 
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FIG, 10 illustrates method steps for decoding an encoded MPEG video bit stream 
in accordance with the novel motion compensation technique as taught by the present 
invention. 

FIG. 11 A illustrates a stride of an image frame represented in the mixed storage 
format as taught by the present invention. 

FIG, 1 IB is a block diagram illustrating a write combining (WC) buffer including 
a cache line containing a Y-component and a UV component represented in the mixed 
storage format as taught by the present invention. 

FIG. 12 illustrates method steps for improved memory management of video 
images utilizing a mixed storage format according to an embodiment of the present 
invention. 

FIG. 13 illustrates additional method steps for improved memory management of 
video images utilizing a mixed storage format according to a further embodiment of the 
present invention. 

FIG. 14 illustrates additional method steps for improved memory management of 
video images utilizing a mixed storage format according to a further embodiment of the 
present invention. 

FIG. 15 illustrates additional method steps for improved memory management of 
video images utilizing a mixed storage format according to a further embodiment of the 
present invention. 

FIG. 16 illustrates method steps for decoding an encoded bit stream according to 
an embodiment of the present invention. 

FIG. 17 illustrates additional method steps for decoding an encoded bit stream 
according to a further embodiment of the present invention. 

FIG. 18 illustrates additional method steps for decoding an encoded bit stream 
utilizing a mixed storage format according to a further embodiment of the present 
invention. 
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DETAILED DESCRIPTION 

System Architecture 

The present invention overcomes the problems in the existing art described above 
by providing a novel storage format enabling a method for improved memory 
5 management of video images. In the following detailed description, numerous specific 
details are set forth in order to provide a thorough understanding of the present invention. 
However, one having ordinary skill in the art should recognize that the invention may be 
practiced without these specific details. In addition, the following description provides 
examples, and the accompanying drawings show various examples for the purposes of 
10 illustration. However, these examples should not be construed in a limiting sense as they 
are merely intended to provide examples of the present invention rather than to provide 
an exhaustive list of all possible implementations of the present invention. In some 
instances, well-known structures, devices, and techniques have not been shown in detail 
to avoid obscuring the present invention. 
15 Referring to FIG. 3, a block diagram illustrating major components of a computer 

system 200 in which the inventive storage format may be implemented is now described. 
The computer system 200 includes a display controller 220. The display controller 220 
is, for example a Video Graphics Adapter (VGA), Super VGA (SVGA) or the like. 
Display controller 120 generates pixel data for display 290, which is, for example, a CRT, 
20 flat panel display or the like. The pixel data is generated at a rate characteristic of the 
refresh of display 290 (e.g., 60 Hz, 72Hz, 75Hz or the like) and horizontal and vertical 
resolution of a display image (e.g., 640x480 pixels, 1024x768 pixels, 800x600 or the 
like). Display controller 220 may generate a continuous stream of pixel data at the 

characteristic rate of display 290. 

Display controller 220 is ai^p provided with a display memory 230, which stores 
pixel data in text, graphics, or video mbdes for output to display 290. Host CPU 210 is 
be coupled to display controller 220 throughsbus 270 and updates the content of display 
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memory 230 when a display image for display 290 is altered. Bus 270 may comprise, for 
example, a PCI bus or the lik\ System memory 280 may be coupled to Host CPU 210 for 
storing data. Hardware video deboder 240 is provided to decode video data such as, for 
example, MPEG video data. MPEGVideo data is received from an MPEG video data 
source (e.g., CD-ROM or the like). Alternatively, the video decoder 240 is implemented 
as, for example, a convention software decoder 282 stored in the system memory 280. As 
such, one of ordinary skill in the art will recognize that the teaching of the present 
invention may be implemented in either software or hardware video decoders. Once 
decoded, the decoded video data is outputted to system^memory 270 or directly to display 
memory 230^ 


71 


15 


20 


25 


Referring to FIG. 4, the components of the video decoder 240 according to a first 
embodiment of present invention are further described. MPEG data received from an 
MPEG data source rnhy be decoded and decompressed as follows. The video decoder 
240 receives an MPEG Kit stream 242 at a Variable Length Decoding (VLD) block 244. 
The VLD block 244 decoded the MPEG bit stream 242 and generates a quanitized block 
246 that is transferred to an Inverse Quantanization Block (IQ block) 266. The IQ block 
266 performs inverse quantization^! the quantized block 246 to generate a frequency 
spectrum 268 for the quantized blockV An Inverse Discrete Cosine Transform (IDCT) 
block 246. performs inverse discrete cosVe transformation of the quantized block 246 
using the frequency spectrum 268 to generate a decoded block 252 in YUV planar format 
that is transferred to the motion compensation block (MCB) 248. Motion compensation 
is performed by the MCB 248 to recreate the MPEG data 256 in YUV planar format. 
Color conversion block 262 converts the MPEG data^256 from YUV color space format 
to the Red, Green, Blue (RGB) color space in order to generate pictures 264. 

Conventional MPEG decoders, such hardware videovdecoder 240 or software 
video decoder 282, decode a compressed MPEG bit stream into^a storage format 
depending on the particular compression format used to encode tnk MPEG bit stream. 
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For the re^ons described above, YUV planar format is the preferred format for 
/r \^ compression oKMPEG images. Consequently, the decoded block 252 outputted by the 
IDCT block 250 a^well as the MPEG data 256 outputted by the MCB 254 are generated 
in YUV planar format\lJnfortunately, YUV planar format is an inefficient format during 
5 motion compensation of tn& decoded block 252. 

FIG. 5C depicts the noVel mixed storage format 300 described by the present 
invention. Careful review of FIGSv 5 A-5C illustrates that Y component values are stored 
in a planar array 300A while the U an\v components are interleaved in a packed array 
300B. Using the mixed storage format 30& decoded block 252 received from the IDCT 
10 block 246 is converted from planar format (FIG. 5B) to the mixed storage format 300. 
Storage of reference frames 260 and MPEG data^56 in the mixed storage format 300 
optimizes motion compensation of the decoded block^52 as depicted in FIGS. 6A and 
6B. 


FIG. 6A depicts a previous frame 260 that is stored in a reference frames block 
15 258. The previous frame 260 includes a 320x280 pixel Y-component 310, a 160x140 
pixel U-component 312 and a 160x140 pixel V-component 314, represented in planar 
format 304 (FIG. 5B). FIG. 6B depicts a portion of the Video Decoder 240. Together 
FIGS. 6A and 6B depict the combination of the decoded block 252 (252A, 252B, and 
252C) and corresponding YUV components, as determined by a motion vector (V) 248, 
20 of the previous frame 260 during motion compensation. During the decoding process, the 
VLD block 244 generates the motion vector (V) 248, which corresponds to the decoded 
block 252 generated by the IDCT Block 250. Referring again to FIGS. 6A and 6B, the 
motion vector (V) 248 identifies Y-block 316, U-block 318 and V-block 320 of the 
previous frame 260. In order to motion compensate the decoded block 252, Y data 252A 
25 is combined with Y-block 316, U data is combined with U-block 318 and V data is 
combined with V-block 320 in order to generate MPEG data 256 (322A, 322B and 
322C). 
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However, if the previous frame 260 and the decoded block 252 are stored in the 
mixed storage format 300 (FIG. 5C), the motion compensation process is streamlined. 
Referring again to FIG. 6A, the previous frame can be stored in the mixed storage format 
300 (FIG. 5C) as indicated by the 320x140 UV-component 330, such that a UV-block 
5 335 is formed. Referring again to FIG. 6B, the decoded block can be stored in the mixed 
storage format 300 (FIG. 5B) as indicated by UV data 340. The UV-block 335 is then 
combined with the UV data 340 in order to generate UV-MPEG data 350. 

Careful review of FIGS. 6 A and 6B illustrates the advantages of using the mixed 
packed format 300 (FIG. 5C) during motion compensation. Motion compensation using 

10 the planar format 304 (FIG. 5B) requires (1) opening three memory pages, (2) using three 
pointers, and (3) performing three operations for each YUV component to motion 
compensate the decoded block 252. In contrast, motion compensation using the mixed 
storage format 300 (FIG. 5C) requires (1) opening two memory pages, (2) using two 
pointers, and (3) performing two operations for each Y and UV component to motion 

15 compensate the decoded block 252. In addition, referring again to FIG. 6A and 6B, 
storage of YUV data in cache memory (not shown) requires thirty-two cache lines in 
planar format 304 (FIG. 5B) (eight-cache lines for the Y-component, eight cache lines for 
each U and V component). In contrast, storage of the Y and UV components in the 
mixed storage format 300 (FIG. 5C) requires twenty-four cache lines (sixteen cache lines 

20 for the Y-component and eight cache lines for the UV component). 

Moreover, benefits from using of the mixed storage format 300 (FIG. 5C) are not 
limited to video decoders. FIG. 7A depicts the steps for encoding and decoding video 
images 400. Careful review of FIG. 7A illustrates that the encoding 402 and decoding 
420 of video images essentially mirror each other. Consequently, referring to FIG. 7B, 

25 the mixed storage format may be used in a Video Encoder 450 during motion estimation 
452, motion compensation 454 and storage of reference frames 456. For the reasons 
described above use of the mixed storage format 300 (FIG. 5C) will reduce hardware 
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costs of the Video encoder (less pointers), and speed up the encoding process (less 
operations and memory access). In addition, cache line efficiency results from using the 
mixed storage format 300 (FIG. 5C), as described above. 

Although use of the mixed storage format 300 (FIG. 5C) provides improved cache 
5 line efficiency, there remains a need to improve access speed to graphics memory. One 
technique for overcoming this problem is using a Write Combining (WC) buffer in order 
to accelerate writes to the video frame buffer, as depicted in FIG. 8. FIG. 8 depicts a 
memory address space 500 including a WC region 502. The WC buffer 512 utilizes the 
fact that thirty-two byte burst writes are faster than individual byte or word writes since 

10 burst writes consume less bandwidth from the system bus. Hence, applications can write 
thirty-two bytes of data to the WC frame buffer 512 before burst writing the data to its 
final destination 508. 

Nonetheless, not all applications take advantage of WC buffers. One problem 
associated with WC buffers is that WC buffers generally contain only four or six entries 

15 in their WC region 502. Consequently, any memory stores to an address that is not 

included in the current WC buffer 512 will flush out one or some entry in the WC buffer 
512. As a result, partial writes will occur and reduce the system performance. 

However, by writing data sequentially and consecutively into the WC region 502 
of the system memory 500, partial memory writes to the WC region 502 are eliminated. 

20 Referring again to FIG. 8, if we write the first pixel 504 in line one of an image, then the 
first pixel 506 in line two of the image, it is very likely that the WC buffer 512 (holding 
only one byte) will be flushed out. This occurs because we are writing to the WC region 
502 in a vertical manner, such that the second write does not map to the same entry of the 
WC buffer 512. In contrast, when we write to the WC region 502 in a sequential and 

25 consecutive manner, the first thirty-two pixels 508 of line one of the image may be 
written to the WC buffer 512 before the first thirty-two bytes of pixels 508 are burst 
written to their final destination in the WC region 502. Once we completely write the 


Express Mail No.: EM560648096US 


10 


042390.P9148 



thirty-second pixel byte 508 in the WC buffer 512, the entire thirty-two bytes of pixels 
508 can be burst written to their final destination. 

In MPEG video decoding, it is important to reduce partial writes during motion 
compensation and during frame output to graphics memory. A novel method for re- 
5 ordering of operations during motion compensation of video frames in order to reduce 
partial writes is now described with reference to FIG. 9 and FIG. 10. 

FIG. 9 is a block diagram depicting steps for decoding an MPEG bit stream. 
Instead of motion compensating a block after a block, we propose motion compensation 
of blocks in groups of four block (as shown in FIG. 10). MPEG video bits streams are 

10 generally decoded as follows: (1) VLD the motion vector and the DCT coefficients of a 
block 522; (2) IQ and IDCT of the DCT coefficients of the block 524; and (3) MC the 
residue of the block with the displaced reference block 526. One problem of this 
approach is that it causes many partial writes. 

Generally, after a block is MC, the resulting block is written back to the frame 

15 buffer. Assuming the video image is stored in a linear fashion and the width of the video 
is larger than the size of an entry in WC buffer, the resulting marcoblock is written to the 
frame as follows. As the block is written back to the frame buffer line after line, each 
eight-byte write starts a new cache line. Thus, after storing four lines, the application is 
forcing the WC buffer to flush some of its entries. That is, partials writes (16 bytes out of 

20 32 bytes) occur. 

However by motion compensating four blocks together, partial writes are 
eliminated. That is, Step one (522) and Step two (524) are repeated four times before 
Step three (526) is performed as depicted in FIG. 10. Furthermore, instead of writing out 
the second line of a first block after the first line of the first block, the first line of a 

25 second block is written out. This is because the first line of the first block and the first 
line of the second block belong to the same cache lines. Consequently, the WC buffer 
500 can easily combine the write operations in a burst operation. However, those skilled 
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in the art will recognize that various numbers of blocks may be chose as the plurality of 
block, such the number of blocks is dependent on the line size of the write combining 
buffer. 

The real advantages of writing the data out in a multiple of four blocks comes 
5 from using the mixed storage format 300 (FIG. 5C). In this format that, Y components of 
a video image have the same stride as the UV components of the video image. Referring 
to FIG. 1 1 A, an image frame 550 is depicted utilizing the mixed storage format 300. The 
image 550 includes a 320x280 pixel Y-component 552 and a 320x140 pixel UV- 
component 554. As such a Y-block 556 has an equal stride 560 as a UV-block 558. As a 

10 result, whenever we finish writing to a full cache line 572 (thirty-two bytes) of a WC 

buffer 570 (FIG. 11B), we generate a "full write" of Y-components 574 (thirty-two bytes) 
and corresponding UV components 576 (thirty-two bytes). We only need four blocks to 
guarantee a "full-write" of the cache line. That is, we don't need four extra blocks to 
guarantee a "full-write" of the cache line. This property provides a distinct advantage 

15 over previous pure planner YV12 format. Procedural method steps for implementing the 
inventive mixed storage format 300 (FIG. 5C) and a modified method for decoding an 
encoded bit stream are now described. 

Operation 

20 Referring now to FIG. 12, a method 600 is depicted for improved memory 

management of image data, for example, in the Video Decoder 240 as depicted in FIG. 4. 
At step 602, an image consisting of a plurality of color components 242 is received. The 
plurality of color components 242 contained in the image are received in a planar format 
304 (FIG. 5B). At step 604, the each of the plurality of color components 242 is 

25 converted into a mixed format 300 (FIG. 5C) of planar format 304 and packed format 

302. In order to convert the plurality of color components to the mixed format 300, one 
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or more of the plurality of color components 242 are stored in a planar format 304 and 
one or more of the plurality of color components are stored in a packed format 302. 

FIG. 13 depicts additional method steps 610 for converting the plurality of color 
components 242 of step 604, such that the plurality of color components are presented in 
5 a YUV color space. At step 612, luminance components (Y) of the image are stored in a 
planar array 300A (FIG. 5C). At step 614, chrominance components (UV) of the image 
are stored in a packed array 300B (FIG. 5C). The image is, for example, a video image 
consisting of a plurality of pixels. 

FIG. 14 depicts additional method steps 620 for converting color components 242 

10 in the mixed format 300 into the packed format 302. At step 622, the luminance 

components (Y) are accessed within the planar array 300A. At step 624, the chominance 
components (UV) are accessed within the packed array 300B. Finally at step 626, an 
interleaving write-out of Y components and UV components is performed to a YUV 
packed array 302. The packed format is, for example, one of YUV2 packed format, 

15 UYVY packed format, YUV 12 packed format, YUV 16 packed format, or YUV9 packed 
format. 

FIG. 15 depicts additional method steps 640 for converting color components 242 
stored in the mixed format 300 into the planar format 304. A step 642, a memory copy is 
performed of the luminance components (Y) within the planar array 300A to a Y-plane 

20 304A of YUV planar arrays 304. At step 644, an alternate read/write-out coping of the 
UV components is performed into respective planes of the YUV planar arrays 304. In 
other words, the U components are written to a U-plane 304B of the planar arrays 304 
and the V component is written to a V-plane 304B of the planar arrays 304. The planar 
format is, for example, chosen as one of YV12 planar format, YUV 12 planar format, 

25 YUV 16 planar format, or YUV9 planar format. In addition, color components are 

presented in a color space chosen as, for example, one of a YUV color space, a YCrCb 
color space, a YIQ color space, or an RGB color space. 
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Referring now to FIG. 16, a method 700 is depicted for decoding an encoded bit 
stream, for example, in the Video Decoder 240 as depicted in FIG. 4. At step 704, a 
portion of the encoded bit stream is received representing an encoded block. 
Alternatively, a quanitized block 246 may be received. At step 706, the encoded block is 
variable length decoded (VLD) to generate a quantized block. When the quanitized block 
is received at step 704, step 706 is not performed. Those skilled in the art will appreciate 
that the encoded block may be decoded in various ways and remain with in the scope of 
this invention. At step 708, inverse quantization (IQ) is performed on the quantized block 
to generate a frequency spectrum for the quantized block. At step 710, inverse discrete 
cosine transformation (IDCT) of the quantized block is performed using the frequency 
spectrum to generate a decoded block. At step 712, steps 704 through 710 are repeated 
for a plurality of encoded blocks. As a result, a plurality of decoded blocks, representing 
a plurality of macroblocks, are formed. At step 714, the plurality of macroblocks are 
motion compensated as a group in order to generate a plurality of motion compensated 
(MC) macroblocks. Finally at step 740, steps 704 through 714 are repeated for each 
encoded block represented by the encoded bit stream. The encoded bit streams is, for 
example, an encoded MPEG video bit stream. 

FIG. 17 depicts additional method steps 716 for motion compensating the 
plurality of macroblocks of step 714. At step 718, four blocks are used as the plurality of 
blocks. Finally at step 720, pixel data of the four MC blocks is written as a group and in 
a sequential manner to a frame buffer, such that prior to being burst written to the frame 
buffer, the pixel data is temporarily held in an entry of a write-combining (WC) buffer 
(the first line of a second block is written out after the first line of the first block). As a 
result, partial writes from the WC buffer are eliminated. The WC buffer is, for example, 
a four-cache line WC buffer having a thirty-two byte cache line length, however, 
alternate numbers of cache lines and cache line lengths are within the scope of the present 
invention. 
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FIG. 18 depicts additional method steps 730 for motion compensating the 
plurality of macroblocks of step 714, such that the decoded blocks are represented in a 
YUV color space, planar storage format. At step 732, luminance components (Y) of the 
decoded blocks are stored in a planar array. Finally at step 734, chrominance components 
5 (UV) of the decoded blocks are stored in a packed array. In other words, the decoded 
blocks are converted into a mixed storage format of planar format and packed format. 

The methods described above can be provided in applications (e.g., video/graphic 
applications) to potentially increase the performance of the applications by decreasing the 
time to motion compensate decoded blocks. Moreover, applications that include the 

10 method as described above, can be stored in memory of a computer system as a set of 
instructions to be executed. In addition, the instructions to perform the methods as 
described above could alternatively be stored on other forms of computer-readable 
medium, including magnetic and optical disks. For example, method of the present 
invention can be stored on computer-readable mediums, such as magnetic disks or optical 

15 disks, that are accessible via a disk drive (or computer-readable medium drive). 

Alternatively, the logic to perform the methods as discussed above, could be 
implemented in additional computer and/or machine readable mediums. Such mediums 
include discrete hardware components such as large-scale integrated circuits (LSI's), 
application-specific integrated circuits (ASIC's), firmware such as electrically erasable' 

20 programmable read-only memory (EEPROM's); and, electrical, optical, acoustical or 
other forms of propagated signals (e.g., carrier waves, infrared signals, digital signals, 
etc.) 

It is to be understood that even though numerous characteristics and advantages of 
various embodiments of the present invention have been set forth in the foregoing 
25 description, together with details of the structure and function of various embodiment of 
the invention, this disclosure is illustrative only. Changes may be made in detail, 
especially matters of structure and management of parts within the principles of the 
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present invention to the full extent indicated by the broad general meaning of the terms in 
which the appended claims are expressed. For example, the particular element may vary 
depending on the particular application for the novel mixed storage format while 
maintaining substantially the same functionality without departing from the scope and 


In addition, although the preferred embodiment described herein is directed to a 
novel mixed storage format for enabling improved memory management of video images, 
it will be appreciated by those skilled in the art that the teaching of the present invention 
can be applied to other system. In fact, systems from low-bit-rate Internet based 

10 communications to high-bit-rate video broadcasting as well as software/hardware video 
encoder/decoders are within the teachings of the present invention, without departing 
from the scope and spirit of the present invention. 

Having disclosed exemplary embodiments and the best mode, modifications and 
variations may be made to the disclosed embodiments while remaining within the scope 

15 of the invention as defined by the following claims. 


5 spirit of the present invention. 
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